Information capacity of optical fiber channels with zero average dispersion 
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We study the statistics of optical data transmission in a noisy nonlinear fiber channel with a weak 
dispersion management and zero average dispersion. Applying path integral methods we have found 
exactly the probability density functions of channel output both for a non-linear noisy channel and 
for a linear channel with additive and multiplicative noise. We have obtained analytically a lower 
bound estimate for the Shannon capacity of considered nonlinear fiber channel. 
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Introduction The classical theorem of information the- 
ory states that the capacity of a power-constrained 
transmission in an additive Gaussian noise channel 
growths logarithmically with increase of the signal to 
noise ratio (SNR). Thus, an improvement of the capacity 
(maximum average information per symbol that can be 
transmitted through the channel) in such systems can 
be achieved by increase of the signal power assuming 
that the noise level is not affected. The Gaussian statis- 
tics of noise is a fundamental assumption in derivation 
of this widely known Shannon's result. Properties and 
applications of bandlimited linear channels with addi- 
tive white Gaussian noise (AWGN) form a foundation 
of modern information theory. It should be emphasized 
that the AWGN linear channel model is not just a sim- 
ple mathematical construction, but is applied directly to 
many practical problems such as, for instance, deep-space 
communication. However, in some applications, nonlin- 
ear response of a transmission medium must be taken 
into account. Evidently, properties of nonlinear informa- 
tion channel can be significantly different from that for 
AWGN models. Interaction of noise and signal in non- 
linear transmission channel can result in non-Gaussian 
statistics of received signals. The theory of non-Gaussian 
information channels though being an evident challenge 
for many decades is not yet well established compared to 
the success of AWGN models. Studies in this fundamen- 
tal research area are additionally motivated by practical 
technological achievements and growing demand for ef- 
ficient high-speed, high quality communications. Recent 
progress in fiber optics attracts much fresh interest to the 
information theory of non-Gaussian nonlinear communi- 
cation channels |2|-|2|- Optical fiber waveguides made of 
silica present low loss, ultra-high capacity, cost-efficient 
transmission media with many attractive features. Us- 
ing optical amplifiers to recover signal power simultane- 
ously at many carrier frequencies (channels) within the 
fiber bandwidth it is possible to transmit optical infor- 
mation data over thousands of kilometres. It is already 
well recognized, however, that the nonlinear response of 
the transmission medium (Kerr effect nonlinearity) plays 



a crucial role in limiting the aggregate capacity of opti- 
cal fiber systems. Accumulation of nonlinear interactions 
with propagation along the transmission line makes fiber 
information channels essentially nonlinear. Evidently, 
nonlinear impairments (or in other words, a level of sig- 
nal corruption due to nonlinearity) depend on the signal 
power. Therefore, in nonlinear channels an increase of 
the signal power would not necessarily improve the sys- 
tem capacity. Recently, in their pioneering work Mitra 
and Stark suggested that from the information theory 
perspective under certain conditions one can treat essen- 
tially nonlinear noisy channels as linear ones with effec- 
tive multiplicative noise Q ■ Applying this idea to multi- 
channel optical fiber transmission systems they derived 
a heuristic linear model with multiplicative noise that 
presumably approximates some features of the original 
nonlinear channel. Though a connection between statis- 
tical properties of such an effective " nonlinear noise" and 
system/signal characteristics is still a subject of further 
research and justification, this intuitive approach outlines 
a possible way to treat nonlinear transmission channels. 
In order to compute the Shannon capacity it is neces- 
sary to make one more step beyond determination of a 
conditional probability. Namely, one has to find the opti- 
mal input signal statistics (that is even more complicated 
functional problem). The channel capacity C defined by 
Shannon is a maximum of the following functional (called 
mutual information) with respect to the statistics of in- 
put signal X, given by distribution function p(X): 
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P(X) 



J VXVY P(X,Y) log 2 
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Here P{X,Y) = P(Y\X)p(X) is the joint distribu- 
tion function of input X and output Y] P ut(Y) = 
J T>XP(Y \X)p(X), and all specific properties of a com- 
munication channel are given by the conditional prob- 
ability P(Y\X). To the best of our knowledge the only 
case for which there exists an explicit analytical solution 
of the corresponding functional optimization problem is 
when the joint distribution of input and output signals 
are Gaussian. In this case the Shannon capacity can be 
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explicitly expressed through an input-output pair corre- 
lation matrix introduced by Pinsker [8(. Difficulties in 
the analysis of non-Gaussian nonlinear channels to some 
extent are caused by a relatively limited number of ap- 
propriate mathematical methods. Therefore, to practi- 
cally estimate the capacity of the nonlinear fiber channel 
most authors 0-0 a PPly the Pinsker formula that for 
the Gaussian noise coincides with Shannon's definition, 
but as a matter of fact gives only the lower bound on 
the capacity Q ■ An interesting open problem is capabil- 
ity of the Pinsker formula to mimic the true behavior of 
the capacity of nonlinear information channels especially 
in the case of large input signal power. Computation of 
Shannon capacity for any realistic optical amplifier trans- 
mission system is a very complicated problem which is 
unlikely to be solved analytically. Therefore, it is of cru- 
cial importance for further progress in this area to find 
basic simplified models of fiber nonlinear channels that 
can be treated analytically. Such solvable models can 
provide guidance to analysis of much more complicated 
general problems in the information theory of nonlinear 
fiber channels. 

In this Letter we present a theoretical analysis of a 
physical model which describes the transmission of light 
signals in a noisy nonlinear fiber channel with zero aver- 
age dispersion. To examine the similarity and difference 
between the effects of nonlinearity and multiplicative 
noise, in parallel, we study a linear model of the channel 
with both additive and multiplicative noise. We calcu- 
late analytically the probability density function (PDF) 
of the channel output for both models. Using our derived 
conditional probabilities we analyze the capacity of cor- 
responding transmission systems. We compare here two 
approaches to the estimation of system capacity: first, 
based on Pinsker's formula for input-output correlation 
matrix and, second, directly applying Shannon's defini- 
tion of the capacity. 

The average propagation of a complex light envelope 
E(z, t) in a noisy optical fibre line with the so-called weak 
dispersion management (see for details e.g. 0, 0) in 
the main order is described by the stochastic nonlinear 
Schrodinger equation: 
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Here n(z, t) is an additive complex white noise with zero 
mean and correlation function (see for notations |9|) 

< n(z,t)n*(z',t') >=< n > 5{z - z')5{t - t'). (3) 

In the present Letter we restrict consideration to the case 
of weakly dispersion-managed fiber systems with zero av- 
erage dispersion < d >= 0. The propagation equation 
then is effectively reduced to the Langevin equation for 
the regularized field u(z) = E(z,0) with the regularized 
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77, u{z = 0) = Uq, 



(4) 



noise n(z) = n(z,0). 

du 

dz 

Here rj{z) is a white noise with zero mean and correlation 
function < n(z)i]*(z') >= DS(z - z'), where D = 2W < 
no > is the regularized noise intensity. To restore the 
capacity for a bandwidth limited signal one has simply 
to multiply all the corresponding results by the channel 
bandwidth W . 

Calculation of a conditional probability Some statisti- 
cal properties of system (0J includin g hi gher-order mo- 
menta have been studied by Mecozzi However, the 
method used in did not permit explicit computation 
of the PDF which is required in the analysis of system ca- 
pacity. Therefore, to calculate the conditional probabil- 
ity P(u, z\u ) we apply here the so-called Martin-Siggia- 
Rose formalism [l2j that presents the conditional PDF 
of the output as the following functional integral: 



q[Z \ =U -Jdz'C[ q (z')} 

P(u,z\u ) = Vqe , 

q(0)=u 



(5) 



where the effective Lagrangian is defined as C[q] = 
(2D)^ 1 \q'—i\q\ 2 q\ 2 . Integral I© can be calculated analyt- 
ically. The substitution q(z) — q(z) exp[i J Q Z dz'\q(z')\ 2 } 
brings Lagrangian to its free form. The Jacobian of this 
transform is unity and in the new variables the integral 
becomes Gaussian. After simple straightforward algebra 
it can be reduced to 
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where the auxiliary "partition function" is 



P (r, (j}',z\r ,(j)o) = J Vqe 

q(0)=r o e ; *0 



(here u = re 1 ^ 
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roe 1 ^ ). The effective action de- 



composes into sum of the classical part and a fluctuating 
part that does not depend on the limits. The fluctuating 
field is calculated by expanding over the complete set of 
eigenfunctions of the operator —d 2 + satisfying zero 
boundary conditions at z' = and z' = z. Omitting de- 
tails of these operations we present a final expression for 
the conditional probability of our nonlinear channel: 
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sinh fcr, 
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here q m = k m rro/(Dsmh(k m z)), k m = \j2imD and Ii m i 
is the modified Bessel function. 

Next we establish an analogy between the considered 
nonlinear channel (NLCH) and a linear channel with mul- 
tiplicative noise (LMNCH): 

u' — ivu = r], u(z = 0) = uq, (9) 

< r)*{z')r](z) >=D6(z-z'), (10) 

< v{z)v(z') >= D'S(z - z') (11) 

Applying a similar procedure to above we derive the con- 
ditional probability function of the form Eq.ljHJ with re- 
placement P m — * P m where 



P m (r, z\r ) 



1 -m 2 D'2/2 T f rr ° 
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Dz 



(12) 



Note that if the information is transmited using only 
signal power (the so-called intensity modulatuion - di- 
rect detection systems) r = \u\ the conditional probabil- 
ity takes the form (after integration in polar coordinates 
(r, 4>) over phase 0: J d(f>P(u, z\u ) = P (r, z\ro))- 

P (r,z\r ) = P (r,z\r ) = ^- I (^) . (13) 

Dz \zDJ 



Note that in both cases (nonlinear and effective multi- 
plicative noise channels) formulae (JSJ) and l|12fl yield the 
same result. 

Channel capacity First we revise the procedure com- 
monly used in the recent literature for the channel ca- 
pacity estimation. We demonstrate here that the consid- 
eration based on pair correlation functions Q can lead 
to results very different from the Shannon capacity and, 
therefore, should be used with caution. Some authors 
0.0 instead of using the original Shannon definition cal- 
culate capacity by exploiting a simpler Pinsker formula 
based on a complex self-conjugate input-output correla- 
tion matrix Cap: 
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K u a u* fj > (14) 



Here indices a, (3 — input, output; and brackets stand 
for the average over noise (rj for non-linear problem and 
r\ and v for double noise model) and over statistics of 
the input signal uq (which is assumed to be Gaussian). 
Defined in this way the Gaussian capacity Cg coincides 
with the Shannon capacity for the case of Gaussian joint 
input-output distributions which corresponds to the lin- 
ear channel with additive noise 8]. For nonlinear chan- 
nels or channels with multiplicative noise the Gaussian 
capacity (|14|) represents the lower estimate for the true 
Shannon capacity C (see 0)- 

We start from the calculation of the correlation matrix. 
To perform noise averages we use either PDF © or (|12H . 



It is easy to find that Cj, 



= S , C n 



(\u(z)\ 2 ^ = {S + N), N = 2D z regardless the model. 

However, the cross-correlations Ci n ,out = (uou*(z)) are 
different 
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where k x = V2iD. Note that SNR = S/N = s changes 
only due to variation of S, while N = 2Dz is fixed as we 
consider here a fixed transmission distance. Substitution 
of the correlation matrix into the definition Eq. (|14|l yields 
the final result 



Cg = log 2 
where 



1 



a(z) 



b(z) 



(1 + s) a(z)\l + sb(z))\ 4 
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(16) 



(17) 



(18) 



It is seen from Eq. I|16(l that with increase of SNR Cg de- 
cays to zero for the case of the nonlinear channel (simular 
to conclusions made in 0,@) and tends to a constant for 
the case of the multiplicative noise channel. However, 
below we will show that in both cases the true Shannon 
capacity C is unbounded and grows logarithmically with 
increase of S/N similar to the linear channel. 

Direct estimate of the Shannon capacity Following 
Shannon we consider now the channel capacity C de- 
fined as a maximum of the mutual information with re- 
spect to the statistics of input, uq, given by distribution 
function p(uq) under the fixed average input power S: 



C = max / d 2 ud 2 uoP(u, uq) log 2 

p(«o) J 



P(u,u ) 



Pout{u)p(U ) 

The conditional probability P(u\uq) connecting output 
and input probabilities: P ou t(u) — J d 2 uoP(u\uo)p(uo) 
is given either by JSJ) or (|12fl . Note that the Shannon 
definition allows one to obtain directly an estimate of 
capacity. Any arbitrary trial distribution p(uo) provides 
for a certain low boundary estimate of the capacity C. 
The closer a trial function is to the optimal distribution of 
p(uq) the better is our approximation of the true capacity. 
Applying the so-called Klein inequality for two arbitrary 
probability distribution functions P and V 



d 2 ud 2 uo-P(w, uq) log 2 



P(u,u ) 



> 



V(u,u ) 

we obtain the following chain of inequalities: 

P(u,u ) 



(20) 



C > 



> 



d ud mq P(u, uq) log 2 



J d 2 ud 2 uo P(u, uq) log 2 



Pout(u)p(u ) 

P{u,u ) 

Pout(u)p(u ) 



(21) 
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where V is an arbitrary PDF (by this we mean that it is 
non-negative and normalized) and p(uo) is an arbitrary 
(not optimal) initial signal distribution. Next we exploit 
an arbitrariness of V and p(u ) in by chosingp(uo) = 
(2 7 r)- 1 p(r ), V(u, u ) = (2tt)- 2 P {r\r ) p(r ). Here we 
assumed that both an input distribution p(uo) and V are 
phase independent and Pq(t\tq) is the radial conditional 
probability given by Eq. fT3|l . Substitution of these trial 
functions into inequality Ea. l|21(l brings it to the form 



C> / drdr rr P (r\r )p(r )log 2 ■ 



Po{r\r ) 



J dr'r'P (r\r')p(r')' 
(22) 

Evaluation of the r.h.s. of this inequality for any trial 
function leads to an estimate of a lower bound for the 
Shannon capacity. Substituting P(r, z\ro) from (|13fl . and 
considering a Gaussian trial function for input statistics 



p(r ) = (2/5)exp(- 
tain: 



i"o/S), after simple algebra we ob- 



C > C (s) = ln(l + s)-2s + F 1 (s) 
Fi(s) = s- 1 



(23) 



da; xKq (x yl + s 1 )Io (x) In Iq (x) 



where Io and Ko are modified Bessel functions and s = 
S/N is the SNR. Then the main contribution from the in- 
tegral Fi(s) to the asymptotic behavior of Cq(s) for large 
s comes from the region x ^> 1. Using the asymptotic 
expansion of modified Bessel functions we get 



C> - In s 
~ 2 



0(1). 



This proves that Cq and hence the Shannon capacity C 
are both unbounded as S/N — > oo. 

Our result in particular shows that a naive straight- 
forward application of the Pinsker formula for evaluation 
of the capacity of a nonlinear channel as, for instance, in 
0, can lead to wrong conclusions regarding the asymp- 
totic behavior of the capacity with S/N — ► oo. Note 
that for the specific problem considered here it is pos- 
sible to modify the definition of C a p to obtain correct 
asymptotics for capacity using input-output correlation 
matrix. Indeed, calculation of Cq constructed with cor- 
relators <rg) = S, (r) = S + N, (rr ) = S F 2 (s), 
F 2 (s) = (2s 2 )' 1 J °° dxx 2 l (x) K (xVl + s" 1 ), leads to 
C G = fn(l + s)/(l + s [1 - F 2 (s)]). Taking into account 
that F 2 (s — > oo) — > 1 one can see that it gives the correct 
asymptotic behavior for capacity. Unfortunately, there is 
no general recipe for choosing the correct correlators in 
the Pinsker formula. 

Discussion and conclusions We have examined the 
statistics of optical data transmission in a noisy nonlin- 
ear fiber channel with a weak dispersion management 
and zero average dispersion. We have also studied simi- 
larity and difference between effects of nonlinearity and 



multiplicative noise, considering in parallel a linear chan- 
nel with multiplicative (and additive) noise. Using an- 
alytically calculated conditional PDF we analyzed the 
Shannon transmission capacity for both models. We did 
manage to find analytically a lower bound estimate for 
the Shannon capacity of the nonlinear fiber channel con- 
sidered here. We revise the Pinsker formula which has 
been used without justification in some recent works and 
show that the Gaussian capacity defined through the pair 
correlation functions should be used with caution in the 
case of nonlinear transmission channels. To incorporate 
the optimization procedure inherent for the Shannon def- 
inition one needs to elaborate all possible correlators and 
find those which are essential, i. e. are much greater 
than others. Those correlators may then be used in the 
Pinsker formula to provide a simple and tractable expres- 
sion for the channel capacity. That would not be neces- 
sary if the Shannon definition could be worked out. Un- 
fortunately, it is hardly the case for any more or less prac- 
tical problem of interest. Another important result of our 
analysis is that nonlinearity and multiplicative noise do 
not necessarily degrade input-output correlations in the 
same way. Therefore, relating the nonlinear problem to 
a linear one with multiplicative noise has to be carefully 
justified for each specific transmisison system model. 
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